Policy Learning With Observational Data

نویسندگان

چکیده

In many areas, practitioners seek to use observational data learn a treatment assignment policy that satisfies application‐specific constraints, such as budget, fairness, simplicity, or other functional form constraints. For example, policies may be restricted take the of decision trees based on limited set easily observable individual characteristics. We propose new approach this problem motivated by theory semiparametrically efficient estimation. Our method can used optimize either binary treatments infinitesimal nudges continuous treatments, and leverage where causal effects are identified using variety strategies, including selection observables instrumental variables. Given doubly robust estimator effect assigning everyone treatment, we develop an algorithm for choosing whom treat, establish strong guarantees asymptotic utilitarian regret resulting policy.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Learning from observational data with prior knowledge

ion Domain layer Inference layer Task layer Strategy layer Figure 2: Degree of abstraction theoretical view When comparing the layers according to degree of abstraction, the above ordering suggests the idea of Fig. 2. However, the design of the strategy layer is, in practice, tightly bound with the problem domain, and can be viewed as more specific than the inference and task layers. Fig. 3 is ...

متن کامل

Learning Optimal Policies from Observational Data

Choosing optimal (or at least better) policies is an important problem in domains from medicine to education to finance and many others. One approach to this problem is through controlled experiments/trials but controlled experiments are expensive. Hence it is important to choose the best policies on the basis of observational data. This presents two difficult challenges: (i) missing counterfac...

متن کامل

Inferring interventional predictions from observational learning data.

Previous research has shown that people are capable of deriving correct predictions for previously unseen actions from passive observations of causal systems (Waldmann & Hagmayer, 2005). However, these studies were limited, since learning data were presented as tabulated data only, which may have turned the task more into a reasoning rather than a learning task. In two experiments, we therefore...

متن کامل

Learning Deterministic Causal Networks from Observational Data

Previous work suggests that humans find it difficult to learn the structure of causal systems given observational data alone. We show that structure learning is successful when the causal systems in question are consistent with people’s expectations that causal relationships are deterministic and that each pattern of observations has a single underlying cause. Our data are well explained by a B...

متن کامل

Learning Policies for Data Imputation with Guided Policy Search

We explore the relationship between directed generative models and reinforcement learning by developing a new approach to data imputation that combines ideas from both areas. We address data imputation by defining an MDP for which we construct policies parametrized by (reasonably) large neural networks. We then show how to train these policies using a form of (self) Guided Policy Search (Levine...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Econometrica

سال: 2021

ISSN: ['0012-9682', '1468-0262']

DOI: https://doi.org/10.3982/ecta15732